Towards Extremely Fast Context Switching in a Block-Multithreaded Processor

نویسندگان

  • Winfried Grünewald
  • Theo Ungerer
چکیده

Multithreaded processors use a fast context switch to bridge latencies caused by memory accesses or by synchronization operations. In the block-multithreaded processor – called Rhamma – load/store, synchronization and execution operations of different threads of control are executed simultaneously by appropriate functional units. A fast context switch is performed, whenever a functional unit comes across an operation destined for another unit. Switching contexts on each load/store instruction sequence allows a much faster context switch in the execution unit than previously published designs do. The results show the potential of multithreading to spare expensive off-chip cache in a workstation environment. The load/store unit proves as the principal bottleneck. In particular the memory cycle time is performance critical. We show that multithreaded processors profit more than conventional RISC processors by a shorter memory cycle time.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Preliminary Performance Study of Architectural Support for Multithreading

This paper discusses the preliminary performance study of hybrid multithreaded execution model that combines software-controlled multithreaded system with hardware support for efficient context switching and threads scheduling. The hardware support for multithreading is augmented with a software thread scheduling technique called set scheduling, and their benefit to the overall performance is d...

متن کامل

MSparc: A Multithreaded Sparc

This paper presents a multithreaded processor, the MSparc. MSparc supports up to four contexts on chip and employs block mul-tithreading. The processor is compatible to standard Sparc processors making multithreading completely transparent to application programs. Switching can be done by hardware or software and is achieved within one processor cycle. Preliminary performance evaluations in a N...

متن کامل

A Practical Processor Design For Multithreading

High speed message handling is one of the most important problems for e cient multithread processing. We have proposed a processor architecture called DatarolII, that promotes the e cient ne-grain multithreaded execution, by performing fast context switching among ne-grain concurrent processes. We are developing a prototype multithread machine KUMP/D (Kyushu University Multi-media Processor on ...

متن کامل

Exploring the Cache Design Space in a Multithreaded Processor

Multithreading can be used to hide latency in a non-blocking cache architecture. By switching execution form one thread to another, the CPU can perform useful work, while waiting for pending requests to be processed by the main memory. This frequent context switching however, produces a very irregular memory referencing pattern. In this paper we examine the effects of associativity and block si...

متن کامل

Partitioning a lenient parallel language into sequential threads

Multithreading as attractive in a large-scale parallel system since it allows split-phase memory operations and fast context switching between computations without blocking the processor. Performance of multithreaded archifectures depends significantly on the quality of multithreaded codes. In this paper, we describe the enhanced thread formation scheme to produce efficient sequential threads f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996